Can Automatic Personal Categorization deal with User Inconsistency?
نویسندگان
چکیده
Document categorization is a daily task in every organization, but it is a very subjective process. While automatic document categorization has been widely studied, much challenging research still remains to support user subjective categorization. This study evaluates and compares the application of Self-Organizing Maps (SOM) and Learning Vector Quantization (LVQ) to automatic document classification according to a subjectively predefined set of clusters in a specific domain, and assesses the effect of user inconsistency on this process. Results show that despite the subjective and inconsistent nature of human categorization, automatic document clustering methods correlate well with subjective, personal clustering. Moreover, adapting a system to its users is limited by users' inconsistency, meaning that a perfect adaptation is an impractical goal.
منابع مشابه
Can Automatic Personal Categorization deal with User Inconsistency?
Document categorization is a daily task in every organization, but it is a very subjective process. While automatic document categorization has been widely studied, much challenging research still remains, to support user subjective categorization. This study evaluates and compares the application of Self-Organizing Maps (SOM) and Learning Vector Quantization (LVQ) to automatic document classif...
متن کاملAutomating Personal Categorization Using Artificial Neural Networks
Organizations as well as personal users invest a great deal of time in assigning documents they read or write to categories. Automatic document classification that matches user subjective classification is widely used, but much challenging research still remain to be done. The self-organizing map (SOM) is an artificial neural network (ANN) that is mathematically characterized by transforming hi...
متن کاملSupporting user-subjective categorization with self-organizing maps and learning vector quantization
we requested the user to reclassify documents that were misclassified by the system. Results show that despite the subjective nature of human categorization, automatic document categorization methods correlate well with subjective, personal categorization, and the LVQ method outperforms the SOM. The reclassification process revealed an interesting pattern: About 40% of the documents were classi...
متن کاملA Tool for Individualizing the Web
The increasing complexity of navigating the Internet is becoming one of the fundamental obstacles to its eeective use. This is due to the nature of the Internet, principally, a disorganized collection of both sites and site documents whose exponential growth rate rapidly is outstripping any user's ability to master it. There are two ways to deal with this complexity: reorganize the structure of...
متن کاملAn Interface to Retrieve Personal Memories Using an Iconic Visual Language
Abstract: Relevant past events can be remembered when visualizing related pictures. The main difficulty is how to find these photos in a large personal collection. Query definition and image annotation are key issues to overcome this problem. The former is relevant due to the diversity of the clues provided by our memory when recovering a past moment and the later because images need to be anno...
متن کامل